# OCR-free Document Understanding
Vietable Donut Docvqa Demo
MIT
A fine-tuned version of the Donut model for Vietnamese document question answering (table data)
Question Answering System
Transformers Other

V
YuukiAsuna
16
1
Docowl2
Apache-2.0
mPLUG-DocOwl2 is an OCR-free multimodal large language model for multi-page document understanding, efficiently encoding document content via a high-resolution document compressor.
Image-to-Text
Safetensors English
D
mPLUG
482
99
Donut Base Finetuned Rvlcdip
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder to process document images.
Image-to-Text
Transformers

D
naver-clova-ix
125.36k
13
Donut Proto
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for image-to-text conversion
Image-to-Text
Transformers

D
naver-clova-ix
30
7
Donut Base
MIT
Donut is an OCR-free document understanding Transformer model composed of a visual encoder (Swin Transformer) and a text decoder (BART).
Image-to-Text
Transformers

D
naver-clova-ix
50.34k
207
Featured Recommended AI Models